In The Claims: 



1. (Cancelled). 

2. (Currently Amended) The system of claim 1 claim 8 , further comprising a 
phone dataset that includes said speech data and said transcription. 

3. (Original) The system of claim 2, wherein said phone dataset is utilized in 
training a speech recognizer. 

4. (Original) The system of claim 2, wherein said phone dataset is utilized in 
building a phonetic dictionary. 

5. (Previously Presented) The system of claim 2, wherein said transformation 
rules are applied to said phone dataset to produce said transformed phone dataset, 
said transformed phone dataset being for use in training a speech recognizer. 

6. (Cancelled). 

7. (Cancelled). 
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8. (Currently Amended) Th e syst e m of claim 7, wh e rein A system for speech 
processing, comprising: 

speech data generated from one or more speech sources; 

an enhanced phone set that includes acoustic-phonetic symbols and 

connectors for extending said enhanced phone set, said enhanced phone 
set including a TIMIT base-phone set and an extended base-phone set, 
said extended base-phone set includ e s including a base phone base 
phon e s for representing a glottal stop variation, a multipl e burst r e lease, 
a fricativ e consonant closure, a vow e l velarization, a vow e l lat e ralization, 
an R coloring, a glid e loss, an R del e tion, a labio velar fricative, and an 
articulator noise; 

a transcription generated by a transcription process that selects appropriate 
phones from said enhanced phone set to represent said speech data; 
and 

transformation rules applied to said enhanced phone set to produce a 

transformed phone dataset, said transformed phone dataset being used 
in building a phonetic dictionary . 

9. (Currently Amended) The system of claim 7 claim 8 , wherein said enhanced 
phone set includes said acoustic-phonetic symbols, said acoustic-phonetic symbols 
being utilized in said transcription process to represent acoustic-phonetic processes 
of said speech data. 

10. (Previously Presented) The system of claim 9, wherein said enhanced phone 
set further includes said connectors used in said transcription process to connect 
said acoustic-phonetic symbols to base-phones affected by acoustic-phonetic 
processes, thereby producing composite-phones. 

11. (Original) The system of claim 10, wherein said connectors indicate how and 
where said acoustic-phonetic processes affect said base-phones. 



7 



12. (Original) The system of claim 11, wherein said connectors include a 
character a >" that is placed to the left of one of said base-phones to indicate that one 
of said acoustic-phonetic processes affects a beginning of said one of said base- 
phones. 

13. (Original) The system of claim 12, wherein said character is placed to the 
left of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes affects a beginning of said one of said composite-phones. 

14. (Original) The system of claim 11, wherein said connectors include a 
character that is placed to the right of one of said base-phones to indicate that 
one of said acoustic-phonetic processes affects an ending of said one of said base- 
phones. 

15. (Original) The system of claim 14, wherein said character is placed to the 
right of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes affects an ending of said one of said composite-phones. 

16. (Original) The system of claim 11, wherein said connectors include a 
character that is placed to the right of one of said base-phones to indicate that 
one of said acoustic-phonetic processes affects an entirety of said one of said base- 
phones. 

17. (Original) The system of claim 16, wherein said character is placed to the 
right of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes affects an entirety of said one of said composite-phones. 
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18. (Original) The system of claim 11, wherein said connectors include a 
character " A " that is placed to the right of one of said base-phones to indicate that 
one of said acoustic-phonetic processes occurred completely within said one of said 
base-phones. 

19. (Original) The system of claim 18, wherein said character " A " is placed to the 
right of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes occurred completely within said one of said composite-phones. 

20. (Currently Amended) Th e syst e m of claim 9, wherein A system for speech 
processing, comprising: 

speech data generated from one or more speech sources; 

an enhanced phone set that includes acoustic-phonetic symbols and 

connectors for extending said enhanced phone set, said enhanced phone 
set including a TIMIT base-phone set and an extended base-phone set; 

a transcription generated by a transcription process that selects appropriate 
phones from said enhanced phone set to represent said speech data, 
said acoustic-phonetic symbols being utilized in said transcription 
process to represent acoustic-phonetic processes of said speech data, 
said acoustic-phonetic cont e nt processes represented by said acoustic- 
phonetic symbols includes including a nasalization, a glottalization 
varianc e , a br e athiness, a labialization, a palatalization, a voicing, a 
d e voicing, a voic e d frication, a low fr e qu e ncy voic e l e ss frication, a high 
frequ e ncy voic e l e ss frication, an epenthetic vowel , a murmur, an air 
puff, a burst quality, an approximation, an abs e nce of a burst/ r e lease, 
and a tongu e clic k ; and 

transformation rules applied to said enhanced phone set to produce a 

transformed phone dataset, said transformed phone dataset being used 
in building a phonetic dictionary . 
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21. (Original) The system of claim 5, wherein said transformation rules include 
merge-type transformation rules that combine two adjacent phones in said phone 
dataset into a single phone selected from said enhanced phone set. 

22. (Original) The system of claim 5, wherein said transformation rules include 
split-type transformation rules that separate one phone in said phone dataset into 
two different phones selected from said enhanced phone set. 

23. (Original) The system of claim 5, wherein said transformation rules include 
replace-type transformation rules that replace one phone in said phone dataset with 
a different phone selected from said enhanced phone set. 

24. (Original) The system of claim 5, wherein said transformation rules include 
change in context-type transformation rules that change one phone in said 
phone dataset to a different phone selected from said enhanced phone set 
depending on context. 

25. (Cancelled). 

26. (Currently Amended) The method of claim 25 claim 32 , further comprising the 
step of combining said speech data and said transcription to produce a phone 
dataset. 

27. (Original) The method of claim 26, wherein said phone dataset is utilized in 
training a speech recognizer. 

28. (Original) The method of claim 26, wherein said phone dataset is utilized in 
building a phonetic dictionary. 
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29. (Previously Presented) The method of claim 26, further comprising the step 
of applying said transformation rules to said phone dataset to produce said 
transformed phone dataset, said transformed phone dataset being for use in 
training a speech recognizer. 

30. (Cancelled). 

31. (Cancelled). 

32. (Currently Amended) Th e method of claim 31, wh e rein A method for speech 
processing, comprising: 

generating speech data from one or more speech sources; 

providing an enhanced phone set that includes acoustic-phonetic symbols and 
connectors for extending said enhanced phone set, said enhanced phone 
set including a TIMIT base-phone set and an extended base-phone set, 
said extended base-phone set includes including a base phone base 
phones for representing a glottal stop variation, a multiple burst release, 
a fricative consonant closur e , a vowel velarization, a vowel lateralization, 
an R - coloring, a glid e loss, an R d e letion, a labio v e lar friacative, and an 
articulator noise; 

producing a transcription using a transcription process that selects 

appropriate phones from said enhanced phone set to represent said 
speech data; and 

applying transformation rules to said enhanced phone set to produce a 
transformed phone dataset, said transformed phone dataset being 
used in building a phonetic dictionary . 
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33. (Currently Amended) The method of claim 31 claim 32 , wherein said 
enhanced phone set includes said acoustic-phonetic symbols, said acoustic-phonetic 
symbols being utilized in said transcription process to represent acoustic-phonetic 
processes of said speech data. 

34. (Previously Presented) The method of claim 33, wherein said enhanced 
phone set further includes said connectors used in said transcription process to 
connect said acoustic-phonetic symbols to base-phones affected by acoustic- 
phonetic processes, thereby producing composite-phones. 

35. (Original) The method of claim 34, wherein said connectors indicate how and 
where said acoustic-phonetic processes affect said base-phones. 

36. (Original) The method of claim 35, wherein said connectors include a 
character that is placed to the left of one of said base-phones to indicate that one 
of said acoustic-phonetic processes affects a beginning of said one of said base- 
phones. 

37. (Original) The method of claim 36, wherein said character is placed to the 
left of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes affects a beginning of said one of said composite-phones. 

38. (Original) The method of claim 35, wherein said connectors include a 
character that is placed to the right of one of said base-phones to indicate that 
one of said acoustic-phonetic processes affects an ending of said one of said base- 
phones. 

39. (Original) The method of claim 38, wherein said character is placed to the 
right of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes affects an ending of said one of said composite-phones. 
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40. (Original) The method of claim 35, wherein said connectors include a 
character that is placed to the right of one of said base-phones to indicate that 
one of said acoustic-phonetic processes affects an entirety of said one of said base- 
phones. 

41. (Original) The method of claim 40, wherein said character is placed to the 
right of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes affects an entirety of said one of said composite-phones. 

42. (Original) The method of claim 35, wherein said connectors include a 
character " A " that is placed to the right of one of said base-phones to indicate that 
one of said acoustic-phonetic processes occurred completely within said one of said 
base-phones. 

43. (Original) The method of claim 42, wherein said character " A " is placed to the 
right of one of said composite-phones to indicate that one of said acoustic-phonetic 
processes occurred completely within said one of said composite-phones. 
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44. (Currently Amended) Th e m e thod of claim 33, wh e r e in A method for speech 
processing, comprising: 

generating speech data from one or more speech sources; 

providing an enhanced phone set that includes acoustic-phonetic symbols and 
connectors for extending said enhanced phone set, said enhanced phone 
set including a TIMIT base-phone set and an extended base-phone set; 

producing a transcription using a transcription process that selects 

appropriate phones from said enhanced phone set to represent said 
speech data, said acoustic-phonetic symbols being utilized in said 
transcription process to represent acoustic-phonetic processes of said 
speech data, said acoustic-phonetic content processes represented by 
said acoustic-phonetic symbols includes including a nasalization, a 
glottalization variance, a breathiness, a labialization, a palatalization, a 
voicing, a devoicing, a voiced frication, a low frequency voiceless 
frication, a high frequency voiceless frication, an epenthetic vowel^-a 
murmur, an air puff, a burst quality, an approximation, an absence of a 
burst /release, and a tongue click ; and 

applying transformation rules to said enhanced phone set to produce a 
transformed phone dataset, said transformed phone dataset being 
used in building a phonetic dictionary . 

45. (Original) The method of claim 29, wherein said transformation rules include 
merge-type transformation rules that combine two adjacent phones in said phone 
dataset into a single phone selected from said enhanced phone set. 

46. (Original) The method of claim 29, wherein said transformation rules include 
split-type transformation rules that separate one phone in said phone dataset into 
two different phones selected from said enhanced phone set. 
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47. (Original) The method of claim 29, wherein said transformation rules include 
replace-type transformation rules that replace one phone in said phone dataset with 
a different phone selected from said enhanced phone set. 

48. (Original) The method of claim 29, wherein said transformation rules include 
change in context-type transformation rules that change one phone in said phone 
dataset to a different phone selected from said enhanced phone set depending on 
context. 



49. (Cancelled). 

50. (Cancelled). 

51. (Cancelled). 
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52. (Currently Amended) Th e m e thod of claim 5 1 , wh e r e in A method for speech 

processing, comprising: 

providing an enhanced phone set that includes enhanced base-phones for 
representing input speech data, acoustic-phonetic symbols that 
represent acoustic-phonetic content of said input speech data, and 
connectors for extending said enhanced base-phones by selectively 
connecting said acoustic-phonetic symbols to said enhanced base- 
phones to create composite enhanced phones, said enhanced base- 
phones include including glottal stop variation phones qq, qh, qcl, qclq, 
qqcl, hqq, and hqh, multiple burst releas e s pp, tt, kk, bb, dd, gg, jhj, 
and chch, fricative consonant closures fcl, thcl, scl, shcl, vcl, dhcl, zcl, 
and zhcl, vowel velarizations/ lateralizations al, ol, and ul, R colorings or, 
ar, and ixr, glide losses ee and oo, R deletions ax_, e_, ix_, ri, and ra, a 
labio velar friacative hw, and articulator noises 1#, b#, hh# ? w#, g#, ly#, 
11#, and lq#; 

producing a detailed transcription of said input speech data using a 
transcription process that selects appropriate phones from said 
enhanced phone set to represent said input speech data; and 

applying transformation rules to said transcription to produce a 

transformed transcription, said transformed transcription being used 
to create a phonetic dictionary for a speech recognition process . 
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53. (Currently Amended) The method of claim 51 claim 54 , wherein said 
connectors include a character that is placed left of one of said enhanced base- 
phones to indicate that one of said acoustic-phonetic processes affects a beginning of 
said one of said enhanced base-phones, said connectors including a character 
that is placed right of said one of said base-phones to indicate that said one of said 
acoustic-phonetic processes affects an ending of said one of said enhanced base- 
phones, said connectors including a character that is placed right of said one of 
said base-phones to indicate that said one of said acoustic-phonetic processes 
affects an entirety of said one of said enhanced base-phones, said connectors 
include a character " A " that is placed right of said one of said base-phones to 
indicate that said one of said acoustic-phonetic processes occurred completely 
within said one of said enhanced base-phones. 
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54. (Currently Amended) The m e thod of claim 5 1 , wh e r e in A method for speech 
processing, comprising: 

providing an enhanced phone set that includes enhanced base-phones for 
representing input speech data, acoustic-phonetic symbols that 
represent acoustic-phonetic content of said input speech data, and 
connectors for extending said enhanced base-phones by selectively 
connecting said acoustic-phonetic symbols to said enhanced base- 
phones to create composite enhanced phones; 

producing a detailed transcription of said input speech data using a 
transcription process that selects appropriate phones from said 
enhanced phone set to represent said input speech data, said acoustic- 
phonetic content processes represented by said acoustic-phonetic 
symbols includ e s including a nasalization, a plurality of glottalization 
varianc e s, a br e athin e ss, a labialization, a palatalization, a voicing, a 
devoicing, a voic e d frication, a low fr e quency voiceless frication, a high 
fr e qu e ncy voic e l e ss frication, an epenthetic vowel , a murmur, an air 
puff, a burst quality, an approximation, an absenc e of a burst/ release, 
and a tongu e clic k ; and 

applying transformation rules to said transcription to produce a 

transformed transcription, said transformed transcription being used 
to create a phonetic dictionary for a speech recognition process . 
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55. (Currently Amended) The method of claim 51 claim 54 , wherein said 
transformation rules include merge-type transformation rules that combine two 
adjacent phones in said detailed transcription into a single phone selected from said 
enhanced phone set, said transformation rules including split-type transformation 
rules that separate a first phone in said detailed transcription into two different 
phones selected from said enhanced phone set, said transformation rules including 
replace-type transformation rules that replace a second phone in said detailed 
transcription with a first different phone selected from said enhanced phone set, said 
transformation rules including change-in-context transformation rules that change a 
third phone in said detailed transcription to a second different phone selected from 
said enhanced phone set depending on a surrounding context of said input speech 
data. 

56. (Currently Amended) The method of claim 5 1 claim 54 wherein at least one of 
said composite enhanced phones includes a plurality of said connectors that extend 
a corresponding one of said enhanced base phones with a plurality of said acoustic- 
phonetic symbols. 
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57. (Currently Amended) Th e m e thod of claim 5 1 wh e r e in A method for speech 
processing, comprising: 

providing an enhanced phone set that includes enhanced base-phones for 
representing input speech data, acoustic-phonetic symbols that 
represent acoustic-phonetic content of said input speech data, and 
connectors for extending said enhanced base-phones by selectively 
connecting said acoustic-phonetic symbols to said enhanced base- 
phones to create composite enhanced phones; 

producing a detailed transcription of said input speech data using a 
transcription process that selects appropriate phones from said 
enhanced phone set to represent said input speech data; and 

applying transformation rules to said transcription to produce a 

transformed transcription, said transformed transcription being used 
to create a phonetic dictionary for a speech recognition process, said 
transformation rules includ e including a first merge rule that merges 
an original phone bcl and an original phone b into a merged phone 
b, said transformation rules including a second merge rule that 
merges an original phone tcl and an original phone t into a merged 
phone t, said transformation rules including a third merge rule that 
merges an original phone kcl and an original phone k into a merged 
phone k. 
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58. (Currently Amended) Th e m e thod of claim 5 1 wher e in A method for speech 
processing, comprising: 

providing an enhanced phone set that includes enhanced base-phones for 
representing input speech data, acoustic-phonetic symbols that 
represent acoustic-phonetic content of said input speech data, and 
connectors for extending said enhanced base-phones by selectively 
connecting said acoustic-phonetic symbols to said enhanced base- 
phones to create composite enhanced phones; 

producing a detailed transcription of said input speech data using a 
transcription process that selects appropriate phones from said 
enhanced phone set to represent said input speech data; and 

ap plying transformation rules to said transcription to produce a transformed 
transcription, said transformed transcription being used to create a 
phonetic dictionary for a speech recognition process, said transformation 
rules include including a first split rule that splits an original phone em 
into a first split phone ah and a second split phone m, said 
transformation rules including a second split rule that splits an original 
phone or into a first split phone ao and a second split phone r, said 
transformation rules including a third split rule that splits an original 
phone al into a first split phone aa and a second split phone 1, said 
transformation rules including a fourth split rule that splits an original 
phone aa=n into a first split phone aa and a second split phone n. 
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59. (Currently Amended) Th e m e thod of claim 5 1 wheroin A method for speech 
processing, comprising: 

providing an enhanced phone set that includes enhanced base-phones for 
representing input speech data, acoustic-phonetic symbols that 
represent acoustic-phonetic content of said input speech data, and 
connectors for extending said enhanced base-phones by selectively 
connecting said acoustic-phonetic symbols to said enhanced base- 
phones to create composite enhanced phones; 

producing a detailed transcription of said input speech data using a 
transcription process that selects appropriate phones from said 
enhanced phone set to represent said input speech data; and 

applying transformation rules to said transcription to produce a transformed 
transcription, said transformed transcription being used to create a 
phonetic dictionary for a speech recognition process, said transformation 
rules includ e including a first replace rule that replaces an original 
phone gg with a replacement phone g, said transformation rules 
including a second replace rule that replaces an original phone qclq with 
a replacement phone q, said transformation rules include a third replace 
rule that replaces an original phone p=v with a replacement phone b. 
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60. . (Currently Amended) Th e m e thod of claim 5 1 wh e r e in A method for speech 
processing, comprising: 

providing an enhanced phone set that includes enhanced base-phones for 
representing input speech data, acoustic-phonetic symbols that 
represent acoustic-phonetic content of said input speech data, and 
connectors for extending said enhanced base-phones by selectively 
connecting said acoustic-phonetic symbols to said enhanced base- 
phones to create composite enhanced phones; 

producing a detailed transcription of said input speech data using a 
transcription process that selects appropriate phones from said 
enhanced phone set to represent said input speech data; and 

applying transformation rules to said transcription to produce a transformed 
transcription, said transformed transcription being used to create a 
phonetic dictionary for a speech recognition process, said transformation 
rules include including a change-in-context rule that replaces an 
original phone aa=n with a changed-context phone aa < n m ng. 
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